Small Sample Statistics for Classi cation Error Rates
نویسنده
چکیده
Several techniques for estimating the reliability of estimated error rates and for estimating the signicance of observed dierences in error rates are explored in this paper. Textbook formulas which assume a large test set, i.e., a normal distribution, are commonly used to approximate the condence limits of error rates or as an approximate signicance test for comparing error rates. Expressions for determining more exact limits and signicance levels for small samples are given here, and criteria are also given for determining when these more exact methods should be used. The assumed normal distribution gives a poor approximation to the condence interval in most cases, but is usually useful for signicance tests when the proper mean and variance expressions are used. A commonly used 62 signicance test uses an improper expression for , which is too low and leads to a high likelihood of Type I errors. Common machine learning methods for estimating signicance from observations on a single sample may be unreliable.
منابع مشابه
Bias in land cover change estimates due to misregistration
Land cover change may be overestimated due to positional error in multi-temporal images. To assess the potential magnitude of this bias, we introduced random positional error to identical classi ed images and then subtracted them. False land cover change ranged from less than 5% for a 5-class AVHRR classi cation, to more than 33% for a 20-class Landsat TM classi cation. The potential for fal...
متن کاملDouble-bagging: combining classifiers by bootstrap aggregation
The combination of classi"ers leads to substantial reduction of misclassi"cation error in a wide range of applications and benchmark problems. We suggest using an out-of-bag sample for combining di0erent classi"ers. In our setup, a linear discriminant analysis is performed using the observations in the out-of-bag sample, and the corresponding discriminant variables computed for the observations...
متن کاملSmall Sample Statistics for Classi cation Error Rates I : Error Rate Measurements
Several methods (independent subsamples, leave-one-out, cross-validation, and bootstrapping) have been proposed for estimating the error rates of classiers. The rationale behind the various estimators and the causes of the sometimes con BLOCKINicting claims regarding their bias and precision are explored in this paper. The biases and variances of each of the estimators are examined empirically....
متن کاملClustering financial time series: an application to mutual funds style analysis
Classi#cation can be useful in giving a synthetic and informative description of contexts characterized by high degrees of complexity. Di3erent approaches could be adopted to tackle the classi#cation problem: statistical tools may contribute to increase the degree of con#dence in the classi#cation scheme. A classi#cation algorithm for mutual funds style analysis is proposed, which combines di3e...
متن کاملThe Error-reject Tradeoff
We investigate the error versus reject tradeo for classi ers. Our analysis is motivated by the remarkable similarity in error-reject tradeo curves for widely di ering algorithms classifying handwritten characters. We present the data in a new scaled version that makes this universal character particularly evident. Based on Chow's theory of the error-reject tradeo and its underlying Bayesian ana...
متن کامل